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Abstract. We consider a Mobile Ad-hoc NETworks (MANET) formed by n nodes that move 
independently at random over a finite square region of the plane. Nodes exchange data if they 
are at distance at most r within each other, where r > is the node transmission radius. 
The flooding time is the number of time steps required to broadcast a message from a source 
node to every node of the network. Flooding time is an important measure of the speed of 
information spreading in dynamic networks. 

We derive a nearly-tight upper bound on the flooding time which is a decreasing function of the 
maximal velocity of the nodes. It turns out that, when the node velocity is "sufficiently" high, 
even if the node transmission radius r is far below the connectivity threshold, the flooding time 
does not asymptotically depend on r. This implies that flooding can be very fast even though 
every snapshot (i.e. the static random geometric graph at any fixed time) of the MANET is 
fully disconnected. Data reach all nodes quickly despite these ones use very low transmission 
power. 

Our result is the first analytical evidence of the fact that high, random node mobility strongly 
speed-up information spreading and, at the same time, let nodes save energy. 
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1 Introduction 

The impact of node mobility in data propagation is currently one of the major issues in Network 
Theory. The new trend is to consider node mobility as a resource for data forwarding rather than a 
hurdle [14, 10]. This is well-captured by the model known as opportunistic Mobile Ad-Hoc NETworks 
(opportunistic MANET), an interesting recent evolution of MANET [18, 11, 10, 22]. In opportunistic 
MANETS, mobile nodes arc enabled to communicate even if a route connecting them never exists. 
Furthermore, nodes are not supposed to have or acquire any knowledge of the network topology (this 
one being highly-dynamic) . Such two features make data communication in opportunistic networks 
a new challenging research topic from both foundational and practical view-points. 
Inspired by opportunistic MANET, we here consider the Evolving Graph yielded by a set of n nodes 
moving over a finite square. Two nodes can exchange data, at a given time step, if their relative 
distance at that time step is not larger than a fixed transmission radius r > 0. 
The aim of this work is to investigate the speed of data propagation in the above evolving graph. 
It is not hard to see that this study must cope with technical problems which are far to be triv- 
ial. Just to get an insight of such difficulties, we observe that classic static concepts like global 
connectivity and network diameter are almost meaningless in this context. On one hand, we can 
easily construct a sequence of node configurations such that every corresponding snapshot (i.e. the 
communication graph at any fixed time step) of the network is not connected while the broadcast 
task can be performed in a logarithmic number of time steps. On the other hand, it is easy to 
construct another temporal sequence of node configurations where the snapshot diameter is always 
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3 while the broadcast task requires 0(n) time. Previous experimental works in this topic show that 
data communication can benefit from node mobility even though all the snapshots of the MANETs 
arc not connected [11,18,21,14,6]. However, to our knowledge, the only analytical evidence of this 
phenomenon is that proved in [11] concerning network capacity, an information-theoretic concept 
which is not known to be related to the speed of data propagation. 

We thus believe that, in order to investigate data propagation on evolving graphs - such as MANETS 
- a different concept and/or performance measure should be adopted and investigated. 
In our opinion, a fundamental role in this dynamic world is played by the Flooding Time: this 
is in fact the desired crucial concept /measure. The flooding mechanism is the simple broadcast 
communication protocol where every informed node sends the source message at every time step (a 
node is said to be informed if it knows the source message) . The flooding time of an evolving graph 
is the first time step in which all nodes are informed. The flooding time is a natural lower bound 
for any broadcast protocol and it represents the maximal speed of data propagation: the same role 
of the diameter in static networks. Flooding time of some classes of Markovian Evolving Graphs [3] 
has been recently studied in [7, 8]. 

Our work provides an analytical study of the flooding time of a natural and relevant class of 
MANETS: we prove a nearly-tight bound on the flooding time showing that high, random node 
mobility can dramatically speed-up data propagation with respect to the corresponding static model. 
This can be seen as a strong and somewhat final improvement of our previous work [8] where we 
(only) proved that (low) random node mobility does not slow-down data propagation. 

Our MANET model: An informal definition. We consider a model of mobile networks called 
geometric Markovian Evolving Graphs, i.e., geometric-MEG [8]. It is the discrete version of the 
well-known random-walk model [5, 9, 12] and can also be viewed as the walkers model [9] on the 
square. 

In geometric-MEG, nodes (i.e. radio stations) move over a finite square region of the plane and each 
node performs, independently from the others, a sort of Brownian motion. In our model we make 
time and space discrete (see Section 2 for details). The mobility parameter is the move radius p. At 
every time step, a node moves uniformly at random to any point which is within distance p from 
its current position; so p determines the maximal node velocity. At any time there is an edge (i.e. a 
bidirectional link) between two nodes if they are at distance at most r. 

It turns out that a Geometric-MEG is a temporal sequence of random disk graphs (random geometric 
graphs [19]). When r is over the connectivity threshold, such random graphs are with high probability 3 
(in short, w.h.p.) connected and have diameter D < n that depends on n and r. In our previous 
work [8], we proved that when p < r (so we are in the case of low node mobility) and the latter is 
over the connectivity threshold, then the flooding time of geometric-MEG is w.h.p. asymptotically 
equivalent 4 to the diameter of the corresponding snapshots. Under the assumption p < r, it is not 
hard to show [8] this is the best the flooding can achieve: the result is in fact asymptotically tight. It 
thus follows that, in the slow case, random node mobility does not significantly affect the flooding 
time with respect to the static case. 

Our results. In this work, we consider the case p > r. This can viewed as a model for opportunistic 
MANETS where message transmission is very difficult due to critic environment conditions and/or 
poor node transmission power while node velocity is high and random. We are thus motivated by 
the futuristic scenario of mobile wireless sensor environments composed of myriads of tiny nodes 
dispersed in the environment and subject to high, unpredictable mobility as a consequence of envi- 
ronmental dynamics such as wind, storms or water streams. To our knowledge, the impact of such 
a high node mobility on the speed of data propagation has never been considered, at least from a 
foundational perspective. 



3 As usual, wc say that an event £ occurs with high probability if P {£) > 1 — 1/n ' 1 '. 

4 Actually, our previous bound leaves an O(loglogn) gap in a small range of the network parameters. 



We provide a nearly-tight bound on the flooding time for such a case. Let G be a Geometric-MEG 
of n nodes over a square of edge size 5 y/n, transmission radius r and move radius p such that r > ro 
and p > cy/logn, where ro and c are sufficiently large positive constants. Then, w.h.p., flooding in 
G is completed within 



It is not hard to show that, for p ^ r, the expected flooding time is Q(^/n/p) [8], so our upper 
bound is nearly-tight and becomes tight whenever the flooding time is Q (log n). 
When the transmission radius r is over the connectivity threshold (i.e. @(\/logn)), our bound implies 
that if the move radius p is (asymptotically) higher than r, the flooding time is (asymptotically) 
smaller than the diameter of the snapshots. When r is smaller than the connectivity threshold (say, 
it is a constant r ) while p > C\/log n, then our bound implies that flooding can be efficiently 
completed despite the snapshot of every time step is formed by several connected components of 
small sizes [19, 13]. 

In general, our upper bound says that, in this case, the flooding time does not asymptotically depends 
on the transmission radius. This fact has important technological consequences in the futuristic sce- 
nario of large, high-mobile MANETS. The two major goals in MANETS are: i) guarantee good and 
fast data communication, and ii) minimize node energy-consumption (which is clearly an increasing 
function of r). It is well-known that in classic, (static or low- mobile) MANETS such two goals are 
in contrast with each other and, thus, a suitable trade-off must be determined (actually, optimizing 
this trade-off is currently a major research issue in ad-hoc networking [2, 15,20]). In particular, we 
know that [19, 13], in order to guarantee global connectivity (and thus data communication) in static 
random geometric graphs, the transmission radius must be J?(\/logn), so it must increase with the 
network size. 

In this context, our bound is a strong mathematical evidence of the fact that, when node mobility 
is relatively high and random, the two above goals are not competing anymore. We can achieve fast 
data- forwarding by using small transmission radius (so, saving node energy). More importantly, the 
transmission radius can be an absolute constant and, so, it does not need to increase as the network 
size does. The technology of node transmitter devices can be thus scalable. Observe that node 
mobility in such opportunistic networks is due to the host mobility which is often fully independent 
from sensor devices: high sensor mobility does not (necessarily) imply high energy consumption [10, 



Adopted Techniques. The bound in [8] on the flooding time for the slow case is achieved thanks 
to the expanding properties of the connected snapshots of the geometric-MEG which are, in turn, 
guaranteed by two facts: 1) The stationary node distribution at every time step is almost uniform; 
and 2) the transmission radius r is over the connectivity threshold. In particular, they imply that, 
starting from the second time step, the number of informed nodes is large enough to apply stan- 
dard Chcrnoff-like bounds. This allowed us to evaluate the number of new informed nodes at any 
successive step. The role of node mobility is thus shown to have a negligible impact on the flooding 
process. 

This scenario is no longer true when r is below the connectivity threshold (say constant). The 
snapshots of the geometric MEG are very sparse and disconnected and, hence, their expanding 
properties are very scarce. This results into a relatively-long initial phase (called Bootstrap) of the 
flooding process where the number of informed nodes is not large enough to get useful concentration 
results from Chernoff-like bounds. 

When p » r, the flooding process is mainly due to node mobility that, roughly speaking, brings 
the source information outside the small connected components of the sparse snapshots: We provide 
a clean analytical description of this phenomenon. A key-ingredient here is a probabilistic analysis 
of the Bootstrap. We present a set of probabilistic lemmas for almost-increasing random processes 
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time steps. 



18]. 
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that allows us to evaluate, at every time step, the number of new informed nodes even when the 
latter has a small expected value. The rather general form of such lemmas might result useful in 
other similar situations where Chcrnoff 's-likc bounds are useless. 



2 The Node Mobility Model 

We consider a model of dynamic graphs, introduced in [8], that is a discrete version of the random 
walk mobility model for radio networks [5]. In the latter model, nodes (i.e. radio stations) move on a 
bounded region of the plane (typically a square region) and each node performs, independently from 
the others, a sort of Brownian motion. At any time there is an edge (i.e. a bidirectional connection 
link) between two nodes if they are at distance at most r (r represents the transmission radius). 
In our model we discretize time and space. We choose to keep the density constant (i.e. the ratio 
between the number of nodes and the area) as the number n of nodes grows. The region in which 
nodes move is a square of side yfn and the density equals to 1. We remark that this choice is only for 
clarity's sake, and all our results can be scaled to any density 5{n). The nodes can assume positions 
whose coordinates are integer multiple of a resolution coefficient e > 0. Formally, nodes move on the 
following set of points 



L n ,e = j(*e,je) I i,j £ N A i,j s$ ^ j 



At any time step, a node can move to one of the positions of L n>€ within distance p from the 
previous position. The positive real number p is a fixed parameter that we call move radius. It 
can be interpreted as the maximum velocity of a node 6 . Formally, we introduce the move graph 
M n ^ e = (L„, e ,.E„,p, e ), where 

E n , P ,e = {(x,y) | x,y e L„, £ d(x,y) < p} 

and d(-, •) is the Euclidean distance. A node in position x, in one time step, can move in any 
position in -T(x), where P(x) = {y | (x, y) G E niPi€ }. The nodes are identified by the first n positive 
integers [n] . The time-evolution of the movement of a single node i is represented by a Markov chain 
{Pi,t ; t € N} where Pj )4 are random variables (in short r.v.) whose state-space is L n>€ and 



P (Pu +1 = x) 



otherwise 



In other words, P^t is the position of node i at time t. Thus, the time evolution of the movements 
of all the nodes is represented by a Markov chain V(n 7 p,e) = {P t : t e N} whose state space is 
L n ,e x in,e x • • • x L n f (n times) and 

Pt = {Pl.t, P2,t, ■ ■ ■ , Pn,t) 

Let us fix a transmission radius r > 0. A geometric-MEG is a sequence of random variables 
G(n, p, r, e) = {G t : t G N} such that G t = ([n], E t ) with 

E t = {(i,j)\d(P i , t ,P j , t )^r} 

As for the stationary case, we observe that the stationary distribution 7Tj of Markov chain {P ijt ; t € 
N} is (sec [1]) 

wx) - !£W| 

W ^ye Ln JP(y)\ 

Moreover, the stationary distribution of V(n,p,e) is the product of the independent distributions 
TTi for all i € [n\. We say that a geometric-MEG Q(n,p,r,e) — {G t : t € N} is a stationary 



Indeed, a node can run through a distance of at most p in a unit of time. 



geometric-MEG if the underlying Pq is random with the stationary distribution of the Markov chain 
V(n,p,e) = {P t ■ t e N}. Notice that if Q(n,p,r,e) — {G t ■ t <G N} is a stationary geometric- 
MEG then all r.v. G t are random with the same probability distribution that we call stationary 
distribution of G(n, p, r, e). 

In the rest of the paper, we will always assume that the move radius p is not larger than ^/n. 
3 Bounding the Flooding Time 

In the flooding mechanism, every node that is informed sends the source message at every time step: 
so, all nodes that are within distance r from an informed node will be informed at the next time 
step. For the sake of simplicity, every time step is divided into two consecutive actions: i) the move 
action, where nodes make their random move, and ii) the transmission action, where the informed 
nodes send the source message. Clearly this assumption does not affect the asymptotical bound on 
the flooding time. 

Our result can be formally stated as follows. 

Theorem 1. Let Q(n,p,r,e) be a stationary geometric-MEG. If r ^ r and p ^ c^/log n for suffi- 
ciently large constants r and c, then the flooding time is w.h.p. 



3.1 Proof's Overiew 

The proof consists of a probabilistic analysis of the number of new informed nodes at every time 
step of the flooding process. In order to cope with this analysis, the temporal process is organized 
in three consecutive phases. Even though it is likely that in the real process these phases happen 
simultaneously rather than consecutively, our analysis yields the desired upper bound. The phases 
depend on the current number of informed nodes and on the "locality-degree" of the process. As 
for the latter, we need to partition the square into equal supercells, i.e., subsquarcs of side length 
L = 0{p 2 ). This partition guarantees that any node v in a supercell S, after the move-action, can 
reach any position in any neighboring supercell with almost-uniform probability. Another crucial 
property yielded by the partition is that, for the first - say - 0{n) time steps, every supercell will 
contain 0(p 2 ) nodes, w.h.p. 

The Bootstrap Phase. In this initial phase, we start our analysis focussing on what happens inside 
the neighborhood of the supercell So containing the source, i.e., the supercell set N(Sq) formed by 
So and its adjacent supercells. We can say that, with positive-constant probability, So contains 
(9(r 2 ) informed nodes after the first time step. Observe that this is the crucial analysis point where 
we need to go from positive-constant probability to high probability and we cannot use Chernoff-likc 
bounds. Indeed, in the successive time steps t > of this phase, we consider the flooding-rate inside 
the supercell S t ' having the maximal number of informed nodes at time step t. We will then prove 
that, after t = O(logn) time steps, w.h.p., there will be (at least) one supercell quasi-informed, i.e., 
it will have 0(p 2 ) informed nodes. 

The Spreading Phase. After the Bootstrap, we can thus assume (w.h.p.) that there is (at least) 
one supercell quasi-informed. We can thus look at the flooding from a quasi-informed supercell to 
its adjacent ones. We show that, w.h.p., if a supercell is quasi-informed at a given time step, then 
all its adjacent supercells will be quasi-informed within the next time step. Since we prove that the 
boundary of any supercell set D has size at least fi{y/\D\), it turns out that this flooding phase 
makes all the supercells quasi- informed within 0(y/n/p) time steps. 

The Filling Phase. At the end of the previous phase, we thus have w.h.p. all supercells quasi- 
informed. The Filling phase consists of the sequence of time steps required to get all supercells 
informed. We prove that, w.h.p., this final process can be completed in O(logn) time steps. 




3.2 Preliminaries 



We need to introduce the following notions. 
— The square is partitioned into squared supercells of side length L with 

9 9 



3V2 2V2 

— Every supercell is partitioned into squared cells of side length I with 

r r 



1 + V2 V2 

- The neighborhood N(S) of a supercell S is the set of supercells formed by S and all its adjacent 
supercells. 

— A supercell is quasi-informed at time t if it contains 7p 2 informed nodes at that time, where 7 
is a suitable positive constant that will be specified later. 

We say that the density condition holds at time t if, for every supercell S, the number of nodes in S 
at time t is at least np 2 , for a suitable constant < 77 < 1. Let V be the following event: the density 
condition holds for every time step t = 0, 1, . . . , n. 

The proof of the following lemma is omitted since it is an easy consequence of the almost uniformity 
of the stationary distribution of geometric-MEG. 



Lemma 1. Let Q{n 1 p,r, e) be a stationary geometric-MEG. If p ^ c-^/logn for a sufficiently large 
constant c, then the probability of event V is at least 1 — 1/n 4 . 

In the rest of this section, we will tacitely assume that event V holds. Thanks to the previous 
lemma, since we are conditioning w.r.t. an event that holds w.h.p., the corresponding unconditional 
probabilities are affected by only a negligible factor. 

For the sake of simplicity, we will use the following probability notations. For an event £ and a r.v. 
X, the notation 

P(£\X) ^ P 
means that, for every possible value x of X, it holds 

P(£ I X = x) < p 



3.3 The bootstrap 

We now provide an upper bound on the time required to get at least one supercell quasi-informed. 
We will prove the bound for r = r where ro is a sufficiently large constant. Observe that, since the 
flooding time is a non-increasing function of the transmission radius r, the same upper bound holds 
for any r 5^ ro as well. 

The following lemma will be used to evaluate the number of new informed nodes after an initial 
sequence of consecutive time steps. Notice that the r.v. X t s can be mutually dependent. 

Lemma 2 (Almost-increasing random processes). Let {X t : teN} be a sequence of random 
variables with X = 1. Assume that two real values a > 1, < f3 < 1, a positive integer M , and a 
probability p G (0, 1) exist such that for every t £ N it holds that 

P (X t+1 < aX t I X t < M, X t - U . . . ,X X ) < p (1) 

P(X t+1 >f3X t \X t ,X t _ u ...,X 1 ) = l (2) 

VP < e logW/3) theU f° T an V 1 > logc-eplogWffl ** hMs that 



Proof. For every t define the binary r.v. Y t whose value is 1 if X t < aX t -i A X t -i < M and 
otherwise. Let us say that a time step t is bad if Yt = 1 and good if Y t = 0. By hypothesis, we have 



P (Y t = 1 | X t -! < M, X t _ 2 , . . . ,X X ) < p 

Consider the first t time steps and observe that if there are r bad time steps and t — r good time 
steps, then either X t > a t ~ T [i r or a time step i ^ t exists such that ^ M. So if < M for 
every i = 1, . . . , t, then the number of bad time steps must be at least r t where 

t log a — log M 

n = 



log(a//3) 



Hence, if t ^ i logM it holds that 

' ^ log a — e p log(a/p) 



p yv** < m >J < p ^ > T *J ^ p > r *) ^ ex p (-p*) 

where in the second inequality we used Lemma 15, in the last inequality Observation 14, and the 
fact that the hypothesis on p implies that r t ^ ept. □ 
The following lemmas allow us to apply the previous lemma to the flooding process. 
We say that a cell C is infected at time t if, immediately after the move action of time step t, C 
contains at least one informed node. The next lemma will be used in the Filling phase as well. 

Lemma 3. Positive constants a and b exist such that, for any time step t and for any supercell S , 
if at the beginning of time step t a supercell S 1 € N(S) contains m informed nodes, then 



P (Z < am 1 ) ^ exp {-bm') 



where Z is the random variable counting the number of infected cells of S at time t and m! 



min{m, L 2 /£ 2 }. 



Proof. Let S' be a supercell in N(S), let / be the set of informed nodes in S', and set m = |/|. 
For any cell C of S, let Zq be the — 1 random variable that has value 1 iff immediately after the 
move action of time step t at least one of the nodes in / is in C. Let Z' — ^2 C Zc- Clearly, it holds 
that Z ^ Z' . Since the Zc's are not independent, we will use the method of bounded differences 
(Lemma 16) in order to prove that Z' is concentrated around its expected value. Firstly, observe 
that 



P(Z c = 0) < 1 ^ 

For the sake of convenience let A = (1 — ^ ) . It holds that 

c 

Let indicate the nodes in I by the integers 1,2, ... ,m. For any i £ I, let Xi be the random variable 
whose value is the position of node i immediately after the move action (at time step t). Let 
/ • YiiLi L n ,e — > K be the function such that /(xi, . . . , x TO ) is the number of cells C of 5* such that, 
for some i, position Xj is in C. Clearly, Z' is equal to the random variable f(X\, . . . , X m ). Consider 
any integer k with 1 ^ k ^ m and any positions Xi, . . . , Xfc_i,Xfc,x' fe e L n ^. Let X\, . . . , X m be the 
random variables such that for i = 1, . . . , k Xi = x^ and for j = k + 1, . . . , m Xj = Xj. Moreover, 
define random variables Y and Y' as follows 

Y = f(Xi,...,X m ) and Y' = f(Xi,...,X k -i,x.' k ,X k+ i,...,X m ). 



It is immediate to see that 

E[F] = E [/(Xl.., X m ) I {X u ...,X k - 1 ) = (xi,...,x fc _i),Xfc=x fc ] 
E[F'] = E [f(Xi . . .,X m ) I (Xi, . . . , = (xi, . . . ,x fe _i),Xfe = x' fe ] 

Thus, we have to bound |E [Y] — E [Y'] |. Let £ and C be the number of cells of S that contain some 
position in {xi, . . . ,Xfc_i,Xfc} and {xi, . . . , Xfc_i, x' fe }, respectively. It is easy to see that if ( = (' 
then E [Y] = E [Y'\ and so |E [Y] - E [Y'\ \ = 0. If, instead, C C then it must be the case that 
|C — C'l = 1- Without loss of generality, suppose that ( = (' + 1. Consider now the analogue of 
variables Zq for the variables X\, . . . , X m and X\, . . . , Xk-i, x' fc , Xk+i, ■ ■ ■ , X m . Call them Zq and 
Z' c , respectively. Clearly, E [Y] = J2c %c and E [Y'] = J2c^'c- Sincc C = C + 1 , there exists a 
cell C of S such that C does not contain any position in {xi, . . . , Xfc-i, x^,}, it contains x^, and for 
every other cell C ^ C of S Zc = Z' c . This implies that 

E [Y] - E [Y'] = P (Z d = l) - P (Z' d = l) = 1 - P (Z' 6 = l) . 

Moreover, it holds that 

i - p (z' e = l) = p (i^ = o) ==: x rn - k . 

We have thus proved that, in any case, it holds that 

|E [/(Xl.., X m ) | (X 1 ,...,X k _ 1 ) = (xi,...,x fc _i),Xfc=x fc ] 
-E[/(X!...,X m )| (X 1 ,...,X fe _ 1 ) = (x 1 ,...,x fe _ 1 ),X fe =x' fe ]| < A 

e\z'] 

By applying Lemma 16 with S = 2 ' , we obtain that 



rn—k 



< P^'-E[Z']|^^1) ^ 2exp^- 



2 y ^ V 1 "l-J'" 2 ; — - ^ 2£™ =1 (A 

Since E \Z'\ ^ jtr(l — A" 1 ), by simple calcutations, we get 

E [Zf p 2 (l - \ m ) 

2£™ =1 (A™- fc ) 2 ^ 4072 

Moreover, since m > m! and m! < 73-, it holds that 



This implies that 



Finally, we have that 



(»-*-)> 5^ 



E[Z'] 2 to' : E[Z'] 2 . »/' 



^ ^tt^ and — - — > 



2j]™ =1 (A m - fe ) 2 25586 2 ^ 227 

P(Z^ am') s$ P (Z ^ am') ^ cxp (-6m') 



where a = ^ and 6 = 25 g 86 • □ 
We can now fix the constant 7 defining quasi-informed supercells: 7 = For any supercell S, let 
mt(S) be the number of informed in 5* at time step t. 
For any time step t let Y t = max{m t (5') : S is a supercell}. 

Lemma 4. For any time step t, it holds that 

P 



(n»<»!i>i<a)<«p(-&) 



Proof. Consider an informed node, and let C be the cell containing that node immediately after the 

2 

move action at time t. The expected number of nodes in C is E [X] — I 2 > ^| and by using Chernoff 
bounds it holds that 



o \ ^ „„„ / ' o 



A" < — < exp , 

56/ P V 224 

Clearly all the nodes in cell C will be informed after the transmission action, hence 

P (Y W < 2Y, I v, < ^) < p (V, + . < I) 



□ 



Lemma 5. For an?/ supercell S and for any time step t, it holds that 

P (m t+1 (S) < 2m t (S) £| < m t (5) < 7P 2 ) < 3 cxp (- ^ 
Proof. For sake of convenience, define to be the event 



Wt = {I <mt(5)<7p2 



For every cell C in supercell S let Zc = 1 if at least one of the m* (S) informed nodes will be in cell 
C immediately after the move action of time t and Zc = otherwise. Let Z = X^ces be the 
number of cells in S infected by the rnt(S) nodes. And let 

a = e 2 z > 



(1 + V2) 2 

be the size of the infected area in S after the move action. 

Now consider a supercell S' adjacent to S. For every node u in S", at the beginning of time t, let 
X u = 1 if u will be in the infected area immediately after the move action and X u = otherwise, 
and let X = J2ueS' X u - Observe that all such nodes will be informed after the transmission action, 
hence m t +i(S) ^ X. In what follows we will evaluate the probability that X is less than 2 times 
m t (S). Since, by assumption, event V holds, we need to evaluate the following 

P(X<2m t (S) \ H t ,V) 

We first write down the above probability in a more suitable way 

P(X<2m t (S) \ H t ,V) = 

P(X <2m | m t (S) = m, T>)P (m t (S) = ra \ H t , T>) 

r 2 

2|<m<7p 2 

If we prove that for every m between r 2 /28 and jp 2 it holds that 



P(X <2m | m t (S) = m, V) 3exp 
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then the thesis follows. Let us then relate the above probability to the size of the infected area 7 

P (X < 2m | mt(S) = m, V) < P (X < 2m | m t (S) =m,V,A^am) + 

+ P (A < am | m t (S) = m,V) (3) 



7 Here and in the sequel we will repeatedly use that for three events A, B, and C we can write P (*4 | C ) ^ 
P(A | B,C)+P(B \C). 



where a is a suitable constant that we will choose later. Now we evaluate the first term of the above 
sum. For every node u in S' it holds that 

P(X U = 1 I m t (S) = m,V, A > am) ^ -Atto (4) 



Hence the conditional expectation of X is 



E [X | m t (S) — m, V, A ^ am] ^ — =m ^ r\(? — ^m = — m (5) 

np np 7r 

If the X u s were independent, by using Chernoff bound we could get 

P ( X < ^p-m m t (S) = m,T>, am) ^ exp f-^m) 

Unfortunately, since event V makes assumption on the future, r.v. X u s conditioned on V are not 
independent anymore. Anyway, this is easy to handle. Split the event V = V\ n T>2 where T>\ is the 
event "The density condition holds since time step i" and T> 2 is the event "The density condition 
holds from time step t + 1 to time step n" . Since T> holds with high probability, then events T>i\T> 
and T> 2 \T> have negligible probabilities, hence conditioning on V is almost the same as conditioning 
on T>i . Formally, we can observe that for every event A it holds that P (.4 | V ) < 2P (A \ T>\ ). 
By conditioning on T>x instead of T>, bounds (4) and (5) remains unchanged, and now we can apply 
Chernoff bound because the X u s conditioned on T>i are independent 

_ / T ^ an , . \ / an \ / an ,\ 

P < —to TO t (5) = to, Pi, A > am J < exp J < ex P V~^^: r o J 

And 

p(a<^to| m t (S) = m, X>, A ^ am) < 2 exp (-^|-rg) 
By choosing a = — , we have that 



P(X < 2m | m t (5) = to, V, A > am) < 2 exp y—^J (6) 

As for the second addend of (3), i.e. P (A < am | mt(S) = m,T>), two cases may arise. 

Case 1: rg/28 < to < i 2 /^ 2 . 
From Lemma 3, it holds that 

P(Z < am | m t (S) = m,T>) < exp (-6m) < exp (~ & ^|) 
where a and 6 are the constants fixed by Lemma 3. Hence 

P(A < am | m t (S) = m,V) = P (z < ^m \ m t {S) = m,X>) 

< P < a{l+ r f^ 2 m | m t {S) = m,V^j 

Thus, for r ^ (1 + \/2)y / a/a 1 it holds that 

p(z < a<yl+ r f^ rn | m t (S) =m,V^j < P (Z < am | m t (5) = m,2?) < exp (^ h i§^J 



Case 2: L 2 /l 2 < to < 7 p 2 . 



From Lemma 3, it holds that 



P (Z ^ aL 2 /f \m t {S) = m,V) < exp (-b- 



where a and b are the constants fixed by Lemma 3. Hence 

P (A < am | m t (S) = m,V) P (A < a^p 2 \ m t (S) = m,T>) 

P (A < 18a 7 L 2 | m t {S) = m,V) 



L 2 

= P ( Z < 18aj— | m t (S) =m,V 



Since 7 < ^§-, it holds that 



L 2 \ ( L 2 

P(Z<18a 7 — \ m t (S)=m,Vj <c;cxpf-6— 



In both cases, since L 2 /£ 2 > r 2 /28, we get 



P{A<am \ m t {S) = m,V) ^ exp [-b^j (7) 
By using (6) and (7) in (3), we obtain 

/ r 2 

P {X < 2m I m t {S) = m, V) < 3exp 

for every r§ < m < -fp 2 . □ 

Lemma 6. Within O(logn) time steps there is a quasi-informed supercell w.h.p. 

Proof. By using simple geometric arguments, if we choose (3 = 1/121, then it holds that Y t +i ^ (3Y t 
with probability one, thus satisfying Hypothesis (2) of Lemma 2. From Lemmas 4 and 5, it is 
easy to see that the r.v. Y t s satisfy Hypothesis (1) of Lemma 2 with constants a = 2, M = jp 2 
and p = 3 exp (^— b^^j. Hence, for a sufficiently large constant r , the thesis follows by applying 
Lemma 2. □ 



3.4 The spreading phase 

Lemma 7 (Local-supercell spreading). For any supercell S, if m t (S) > jp 2 then the event 
m t+ i(S") ^ 7p 2 holds for any supercell S' € N(S) with probability at least 1 — 1/n 4 . 

Proof Let S' be any supercell in N(S). Since m t (S) ^ -fp 2 > /0 2 /r 2 (remind that r is a constant 
that we can set sufficiently large), from Lemma 3, it follows that 

P 2 \ ( P 2 

Z^a-j\ < exp -6-2 

'0/ V '0 

where Z is the random variable counting the number of infected cells of S' at time t. 

2 

Let A = £ 2 Z ^ ^-Z be the size of the infected area. From the above inequality, it holds that 

P 2 \ , ( ,P 2 



A < ayj sC exp ^-6^ J (8) 

Now consider a super-cell 5"' adjacent to S' and different from S. We want to evaluate how many 
nodes from S" move to the infected area. For every node u in S" let X u be the binary r.v. whose 



value is 1 iff node u, after the move action, is in the infected area of S' . Since we are assuming 
the density event V holds, the r.v. X u 's are not independent. We thus first bound the relative 
probabilities under the weaker assumption that the density condition holds till time step t — 1: This 
makes r.v. X u } s independent. Then, we bring this bound under the density condition V. 
It is easy to see that 

1 



P [X u = 1 



7 

Since the density condition holds before the move action of time t, we get 



E 



X 



A > 



7 



3i a 



VP 
7tt 



where X is the r.v. counting the number of nodes of 5"' moving to the infected area of S' . Observe 
that X is a lower bound on the number of informed node in cell S" after the transmission action of 
time step t. By applying Chernoff 's bound, we obtain 



P X < a 



VP 
14tt 



VP 

< exp -a— 
5D7r 



Then, from (8), we get 



P [X <a 



VP 
14tt 



sC P X < a 



VP 



VP 
14tt 

2 



A > a- 



7 



P [ A < a- 



< exp ( -a— ) + exp 



1 



for a suitable choice of constant c such that p ^ c^/logn. Since 7 ^ ajf^, we obtain 

P (X > 7P 2 I V ) > (1 - ±A 2P (25) > 1 - 1 



□ 

In order to prove a bound on the number of time steps that are sufficient to guarantee (with high 
probability) that one quasi-informed supercell spreads the information to all the supercells, we need 
two lemmas. The first one will provide a bound on the number of supercells that are adjacent to an 
arbitrary set of supercells. 

Let Q be a m x m square grid, that is, Q is a square partitioned into m x m congruent sub-squares, 
called cells. For any subset B of the cells of Q, define the boundary dB of B as the set of all the 
cells that do not belong to B and that are adjacent to some cell in B: 

dB = {c I c g B A 3c' e B : d is adjacent to c}. 

Lemma 8 (Boundary size). Let Q be a m x m square grid and let B be any subset of the cells 

of Q. It holds that 

\dB\ v/minllBl,™ 2 - \B\}. 

Proof. For the sake of convenience, we say that a cell in B is a black cell and all the cells not in B 
are white cells. We say that a row of the grid Q is black if all the cells of the row are black. Similarly, 
we define a black column. Moreover, a row or a column which contains both at least one black cell 
and at least one white cell is said to be gray. Let b r and b c be, respectively, the number of black 
rows and the number of black columns. To prove the bound on \dB\ we distinguish four cases. 

b r = A b c > 1: In this case, every row is gray. This implies that every row contains at least one cell 
in dB (if the leftmost cell of the row is black then the leftmost white cell belongs to dB otherwise 
the cell immediately to the left of the leftmost black cell belongs to dB). Since the rows are 
to, it follows that \dB\ > to. It immediately derives that \dB\ ^ ^/min{|_B|, to 2 — \B\}, since 
^/min{|£?|, to 2 — \B\} < Vro 2 = to. 



b r ^ 1 A b c = 0: This case is symmetric to the previous one. 

b r > 1 A b c > 1: Without loss of generality, assume that b r ^ b c . Since b c ^ 1, there are m — b r gray 
rows. Thus, |<9i?| > m — b r . It is easy to see that the number of cells belonging to either black 
rows or black columns is to ■ b r + to ■ b c — b r ■ b c . Since these are black cells, it holds that 

\B\ ^ to ■ b r + m ■ b c — b r ■ b c = m ■ b r + b c (m, — b r ) ^ to ■ b r + b r (m — b r ). 

Now, it easy to verify that the inequality \B\ ^ to ■ b r + b r (m — b r ) implies that 

b r ^ to 



\B\ 



Hence, it holds that \dB\ > to — b r ^ to — (to — a/ to 2 — \B\) = ^Jrri 2 — \B\, and thus \dB\ ^ 



>/™n{|B|, 



\B\}. 



b r = A b c = 0: Let y r and y c be, respectively, the number of gray rows and the number of gray 
columns. Since there are neither black rows nor black columns, it must be the case that every 
black cell belongs to both a gray row and a gray column. This implies that 

Vr ■ Vc > \B\. 

Without loss of generality, assume that y r ^ y c . It follows that j/ 2 ^ \B\, and so y r ^ Since 
every gray row contains at least a cell in dB, it holds that \dB\ ^ \/\B\ ^ ^/min{|£>|, to 2 



\B\}. 
a 

The second lemma will allow us to prove an upper bound on the number of steps to get all the 
supercells quasi-informed when, in one time step, the information propagates from all the quasi- 
informed supercells to their adjacent ones. 

Lemma 9 (Spreading time I). Let K be any integer with K > 1 and let {q t \ t e N} be a 

succession of integers such that q ^ 1, for every t ^ 0, q t ^ K and q t +\ ^ qt + ^mm{q t , K — q t }. 
Then, it holds that, for every t ^ h\J~K , q t = K. 

Proof. If K ^ 2 it is immediate to see that the thesis holds, so in the sequel of the proof we assume 
that K ^ 3. Firstly observe that if q t ^ K — 1 then q t +i ^ qt + 1. This implies that an integer r > 
exists such that t = min{t | q t ^ K/2}. Now we prove, by induction on i, that, for every t with 
^ t ^ t, q t ^ For t = 0, 1, 2 it is trivially true. Assume that it is true for t ^ 2 (and t + 1 ^ r). 
Then, since <^ < K/2, it holds that 



9t+i > qt + s/qi > 



Vst (t + i) 2 



and this completes the proof by induction. From this it follows that if t > 1 then q T -\ ^ 
Thus, it is easy to verify that r ^ 2\f~K + 1. Now, we prove, by induction on t, that 



Vt : t ^ t € 



5VZ 



qt ^ K 



t 



For t = t it holds that 



if 



(5\AK - 2\[K - \) 
<,K- ± '— (since r < 2^K + 1) 



9K 



(zVK-l) 2 1 



3 9 2 y 



Assume now that it is true for t > r (and t + 1 < )• This implies that an integer 5 ^ exists 

such that 

where we set h = \5\/K] — t. Since g t > if/2, it holds that 



-h 2 - 




~h 2 ~ 


9 




9 



<7t+l ^ qt + y/K-qt = K 
From the inequality Va ^ b > \Ja — b ^ y'a — V6, it derives that 



5 + 



ft2 

9 



5 > 5- 



/j2 

9 



9 



where the last inequality (i.e. 8 ^ VS) holds since 5 is an integer. Thus, 

q t +i > K- 

Now, we prove that, for every integer j ^ 1, the following inequality holds 









9 




9 



/ 


'j 2 ' 




"j 2 ~ 




ro--i) 2 i 




9 


9 




9 



For j = 1, 2, ... , 7, it can be exhaustively proved case by case. Let j ^ 8. It holds that 

> ft > Z > 8 + 2 J 

^ V 9 ^ 3 " 9 



;2 



where the last inequality is due to the assumption that j > 8. Moreover, 

< J 2 ■ ! (J" 1 ) 2 = 8 + 2J 
^ 9 9 9 

Thus the inequality is proved. By using it with j = h, we obtain that 



-.r 




f(i - i) 2 l 


9 




9 



q t +i > K - 



-h 2 - 




"h 2 ~ 


> if- 


r(^-i) 2 i 


9 




9 




9 



K 



5v^l - (t + 1) 



and this completes the proof by induction. Thus, we have proved that for t = [~5Va| , it holds that 

2" 



> K 



if. 



Hence, for every t > |~5va] , it holds that g t = K. 

By combining Lemmas 7, 8 and 9 we get the following bound. 



□ 



Lemma 10 (Spreading time II). If at time t\ ^ n/2, there is at least one quasi-informed supercell 
then, with probability at least 1 — ^ , at every time t with t\ + 22-^ ^ t ^ n all the supercells are 
quasi-informed. 



Proof. In the sequel, when we write time t we mean time t\ + t. For any t ^ 0, let Qt be the set 
of quasi- informed supercells at time t. By hypothesis |Qo| ^ 1- In virtue of Lemma 8, if all the 
supcrcclls in Q t and all their adjacent supercells get quasi-informed, at time t+1, then 



\Qt+i\ > \Qt\ + y/mm{\Q t \,N- \Q t \} 

where N — is the number of supercells. This implies that if the above inequality does not 
hold then there exists a supercell S £ Q t such that either S or one adjacent supercell of S is not 
quasi- informed at time t+1. It follows that 

P (ifit+il < \Q t \ + V^H\Qt\,N-\Q t \}) P (3S e Qt ■■ £ s ,t+i) 

where £s,t+\ is the event that occurs if S or one adjacent supercell of S is not quasi- informed at 
time t+1. By the union bound, it holds that 

P (35 G Q t : £ s ,t+i) < ]T P (S G Q t A £ s ,t+i) = ]T P (£s,«+i | S € Q t ) P (S € Qt) . 

s s 

From Lemma 7, for every supercell S, 

P(£s.t+i \SeQ t ) < \- 

It follows that 

P(|Qt+i| < |Qt| + Vmin{|Qt|,7V-|Qt|}) ^^P(£ s ,t+i I 5 G Qt ) P (S e Qt) 

s 

<^P(£s,t+i |5eQt) 
s 

TV 1 
< — < — ' 

Thus, by the union bound, with probability at least 1 — , it holds that 



Vt = 0,l,...,n/2 |Qt+i| ^ |Qt| + Vmin{|Q t |,iV- |Q t |}. 

By applying Lemma 9 with g t = |Q t | and K — N,we obtain that, for every t with ^t^ n/2, 

\Qt\ = N. The thesis follows by taking into account that t\ ^ n/2, t means time t\ + t, and 

3.5 The filling phase 

We first prove that a node not yet informed and belonging to a quasi-informed supercell will get 
informed in one time step, with a constant probability. 

Lemma 11. There exists a constant /3 > such that, for any node u, if at the beginning of a 
time step t the supercell that contains u is quasi-informed and node u is not informed then, with 
probability at least (3, node u gets informed by the end of time step t. 

Proof. Assume that at the beginning of time step t node u is not informed and the supercell S that 
contains u is quasi-informed. Thus S contains at least 70 2 ^ informed nodes at the beginning of 
time step t. From Lemma 3, it derives that 



where Z is the random variable counting the number of infected cells of S at time t. Now, let U be 
the event that occurs if node u gets informed by the end of time step t. It is immediate to see that, 
for any k ^ 0, 

kf 2 

p(u\z = k) > 

irp z 

It follows that 

P(U) = J2 P (U \Z = k)P{Z = k) 

k^O 

> P(W \Z = k)V{Z = k) 

k^^4- 

*"o 

= fp(z^ 
67T V 7-n 



a I ( bp 

'0 



for a suitable constant (3 > 0. □ 
From the above lemma we can derive a logarithmic upper bound for the filling time. 

Lemma 12. If a time step ti ^ exists such that at every time step t with ti ^ t ^ n all the 
supercells are quasi-informed, then by time t 2 + C(logn) all the nodes are informed, w.h.p. 

Proof. In the sequel, when we write time t we mean time t 2 + t. Let u be any node. For any time 
t, let Ut be the event that occurs if node u is not informed by the end of time step t. Let (3 be the 
constant of Lemma 11 and let k be a constant such that /3 fclo s™ < By hypothesis, for every 
t = 0,1,.. . fclogn, all the supercells are quasi-informed. This implies that at each of these time 
steps the supercell that contains u is quasi- informed. Thus, from Lemma 11, it holds that 



(k log n \ k log n 

a u t = n p(«* 



3=0 



n 2 ' 



This means that a node u is not informed within fclogn time steps with probability at most 

i ^ 

Hence, a direct application of the union bound shows that, with probability at least 1 — ^, all the 
nodes are informed within fclogn steps. □ 

Finally, Theorem 1 follows from Lemma 6, Lemma 10, and Lemma 12. 



4 Conclusions 

Some interesting issues concerning the flooding time on geometric-MEG are still open. There is a 
logarithmic gap between our upper bound and the known lower bound [8] when the move radius p 
is very large. Closing this gap is an open problem. 

A more challenging open issue is to extend our upper bound in the case where p and r are both 
very small (i.e. below \f\ogn). 

Another open research work is to study the flooding time when the starting distribution is not 
the stationary one but is arbitrary (i.e. a worst-case analysis). We conjecture that the worst-case 



flooding time is not asymptotically larger than the stationary one for a large range of the network 
parameters. 

Observe that our upper bound can be easily extended to the gossiping task (i.e. the all-to-all com- 
munication). It would be interesting to extend our analysis to other basic communication tasks such 
as data-gathering and routing. 

Finally, a major challenge is to obtain results similar to our upper bounds for more realistic mobility 
models [17]. 
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A Basic Probability 

Lemma 13 (Chernoff 's bound). Let be X — Ym=i where X\, . . . , X n are independent Bernoulli 
random variables and let be < e < 1. I/O < fj, ^ E[X], then it holds 

P{X < (1 - e)ri ^ e~$» 
Observation 14 Let X be a binomial random variable with E [X] = /i. Then 

P(X ^ en) sC e - ^ 

We use the following standard probability bound (See [4]). 

Lemma 15. Let X\, . . . ,X n be a sequence of random variables with values in an arbitrary do- 
main, and let Y\,...,Y n be a sequence of binary random variables, with the property that Yi = 
Y i (X 1 ,...,X i ). If 

P(Y i = l\X 1 ,...,X i - 1 )^p 

then 

P(j2 y i> k ) <P(B(n,p)^k) 

where B(n,p) denotes the binomially distributed random variable with parameters n and p. 

The following lemma states one inequality of the method of bounded differences (see Corollary 6.10 
in [16]). 

Lemma 16. Let X\, . . . ,X n be random variables, with Xk taking values in a set A k for each k, 
and let X_ denote the vector (Xi, . . . , X n ). Let f : JJ k=1 Ak - » K be an appropriately measurable 
function. Suppose that there are constants c\, . . . , c n so that 

|E[/Q0 I (X 1 ,...,X k - 1 ) = {x 1 ,...,x k -i),X k = x k ] 
- E [f(X) | (X x , . . .,X k -i) = [x-l, . . .,x k -i),X k = x' k ] | s$ c fc 

for each k = 1, . . . ,n and Xi € A{ (i = 1, . . . , k — 1) and x k ,x' k e A k . Then for any 8 > 0, 
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P(|/(20-E[/(2Q]|^) < 2cxp(- 



